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The modularity of a network quantifies the extent, relative to a null model network, to which 
vertices cluster into community groups. We define a null model appropriate for bipartite networks, 
and use it to define a bipartite modularity. The bipartite modularity is presented in terms of a 
modularity matrix B; some key properties of the eigenspectrum of B are identified and used to 
describe an algorithm for identifying modules in bipartite networks. The algorithm is based on 
the idea that the modules in the two parts of the network are dependent, with each part mutually 
being used to induce the vertices for the other part into the modules. We apply the algorithm to 
real-world network data, showing that the algorithm successfully identifies the modular structure of 
bipartite networks. 
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I. INTRODUCTION 

Networks have attracted a burst of attention in the last 
decade (useful reviews include Refs. [1-4]), with applica- 
tions to natural, social, and technological networks. Of 
great current interest is the identification of the modular 
structure of the network. Detecting modules, or commu- 
nities, allows quantitative investigation of relevant sub- 
networks, which may have different properties from the 
aggregate properties of the network as a whole, e.g., mod- 
ules in the World Wide Web are sets of topically related 
web pages. 

Informally, a network module is a subgraph whose ver- 
tices are more likely to be connected to one another than 
to the vertices outside the subgraph. A variety of ap- 
proaches [5-13] have been taken to explore this concept. 
See Refs. [14, 15] for useful reviews. 

In this work we focus on the measure called modular- 
ity, introduced by Newman and Girvan [11]. Modularity 
reflects the extent, relative to a null model network, to 
which edges are formed within modules instead of be- 
tween modules. Using the modularity, we can assess the 
quality of any assignment of vertices to modules. Further, 
the module identification problem becomes a modularity 
optimization problem. However, exact maximization of 
the modularity is in general an intractable problem, be- 
cause the number of ways to partition the set of vertices 
grows extremely rapidly [16]. In light of this, a num- 
ber of effective algorithms have been introduced to find 
high modularity partitions of the vertices [17, 18]. The 
modularity can be also be defined in terms of a so-called 
modularity matrix, the eigenspectrum of which has a fun- 
damental relationship with the modular nature of the 
network [19]. 

Given the explicit dependence of the modularity upon 
a null model, it is clear that the specific choice of null 
model has a profound impact on the modularity. Sur- 
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prisingly, only one null model has been so far explored at 
length: networks with edges randomly assigned such that 
the expected degrees of model-network vertices equal 
the actual degrees of corresponding real-network vertices 
[19]. Specific classes of networks have additional con- 
straints that could be and, indeed, should be reflected in 
the null model. 

A significant such class of networks is that of bipartite 
networks. The vertices of a bipartite network can be par- 
titioned into two disjoint sets such that no two vertices 
within the same set are adjacent. There are thus two dis- 
tinct kinds of vertices, providing a natural representation 
for many afhliation or interaction networks, with one kind 
of vertex representing actors and the other representing 
relations. Examples of actor-relation pairs include peo- 
ple attending events [20-22], court justices making de- 
cisions [22], scientists jointly publishing articles [23, 24], 
organizations collaborating in projects [25, 26], and leg- 
islators serving on committees [27]. Arguably, bipartite 
networks are the empirically standard case for social net- 
works and other interaction networks, with unipartite 
networks appearing — often implicitly — as projections. 

In the statistical physics community, the usual ap- 
proach taken to identify modules in bipartite networks 
is to first construct a unipartite projection of one part of 
the network, and then identify modules in that projec- 
tion using methods for unipartite networks. For example, 
in the scientist-publication network mentioned above, a 
network of scientists is created by linking scientists when 
they have jointly published. These unipartite projection 
can be illuminating, but intrinsically lose information — 
indeed, Guimera et al. [28] demonstrate that analysis of 
an unweighted, unipartite projection can give unreliable 
or incorrect results. 

The principal contribution in this work is a proposed 
definition of a modularity for bipartite networks. The ap- 
proach taken is based on defining a bipartite modularity 
matrix B as an extension of the recent work by Newman 
[19]. Some key properties of the eigenspectrum of B are 
identified and used to specialize Newman's matrix-based 
algorithms to bipartite networks. An additional algo- 



2 



rithm fundamentally based on the bipartite character of 
the networks is introduced; we call the algorithm BRIM, 
for bipartite, recursively induced modules. 

In parallel, Guimera et al. [28] have independently in- 
vestigated modularity in bipartite networks. They pro- 
ceed by first identifying the two parts of the network as 
actors and teams, and then formulating a bipartite mod- 
ularity in which modules consist of groups of actors that 
arc closely interconnected based on joint participation in 
many teams. The resulting modularity is thus focused on 
identifying modules in only one part of the network at a 
time. Interesting, Guimera et al. point out the possibili- 
ties of classifying both partite sets of the network simul- 
taneously and of customizing spectral methods for bipar- 
tite networks, which is essentially the approach taken in 
the present work. 

As of this writing, we are aware of no other attempts to 
define modularity for bipartite networks. However bipar- 
tite networks, or "two mode networks," have undergone 
several related studies in the sociology community us- 
ing other methods (see, e.g., Refs. [21, 22] and references 
cited therein). 

The structure of the paper is as follows: in section II 
we define a modularity matrix and measure for bipar- 
tite networks. We discuss using the bipartite modularity 
matrix to identify modules in section III, and apply the 
algorithm therein devised to two real-world networks in 
section IV. Finally, we conclude in section V with an as- 
sessment of the present investigation and an outlook for 
future work. 



II. BIPARTITE MODULARITY 

In this section, we develop a modularity matrix for 
bipartite networks. Structurally and notationally, the 
development parallels the discussion of the modularity 
matrix by Newman [19]. 

Consider a network with n vertices and m edges de- 
fined by an adjacency matrix A. Each vertex i is as- 
signed to a community group or module, denoted by gi. 
The modularity Q for such an assignment reflects the ex- 
tent, relative to a null model, to which edges are formed 
within modules instead of between modules. Formally, 
the modularity is defined as 

<3 = ^ I] i^ij - Pij) ^ i9i,9j) , (1) 
i,j 

where the Aij are the adjacency matrix elements and the 
Pij are probabilities in the null model that an edge exists 
between vertices i and j. 

The modularity can be given an equivalent definition 
in matrix form. First, the n community indices gi with 
values taken from {1, 2, . . . , c} are replaced by an n x c 
index matrix S = [si | S2 | • • • | Sc] , where c is the number 
of modules. All elements of S take on either a or 1 value, 
so that column Sj is an index vector showing membership 



in module i: a value of 1 in position j of Sj indicates that 
vertex j belongs to module i. Given that each vertex is 
assigned to exactly one module, each row of S has a single 
unit value and the index vectors are thus orthogonal. 

Further, a modularity matrix B is defined with ele- 
ments 

Bij = Aij — Pij . (2) 
Using S and B, the modularity becomes 

g = ;r^TVSTBS . (3) 
2m 

The eigenspectrum of B has a fundamental relationship 
with the modular nature of the network, as Newman [19] 
has explored. 

From Eqs. (1) through (3), it is apparent that the 
choice of null model has a profound impact on the modu- 
larity. Thus, for example, a Bernoulli random graph with 
constant Pij = p for all i and j is a poor representation 
of most real- world networks, so would be an inappropri- 
ate choice of null model. Instead, the usual choice of null 
model [19] assigns edges at random with the expected de- 
grees of model vertices constrained to match the degrees 
in the actual network. 

In much the same fashion, bipartite networks have 
specific constraints that should be reflected in the null 
model. The vertices of a bipartite network can be par- 
titioned into two disjoint sets such that no two vertices 
within the same set are adjacent. An equivalent, but 
more visual, deflnition is that the vertices in a bipartite 
graph can be assigned one of two colors, say red and blue, 
with no neighboring vertices bearing the same color. In 
the remainder of this section, we will deflne a null model 
with the above requirement that the expected degrees 
match the degrees in the real network, along with the 
additional constraint that each edge links a red vertex 
and a blue vertex. 

Let p be the number of red vertices and q be the num- 
ber of blue vertices; this implies n = p + q. Without loss 
of generality, assume that the vertices are indexed so that 
red vertices are labeled 1, 2, . . . ,p and the blue vertices 
arc labeled p + \,p + 2, . . . ,p + q. The adjacency matrix 
then has a block off-diagonal form of 

n A 

y.-^ )qy.p ^qxq 

where Ojxj is the all-zero matrix with i rows and j 
columns. Require the same block structure for P that 
is exhibited by A, giving 

~ (P'^) o ■ ^ ' 

\^ )qxp '-'qxq _ 

This form for P assigns zero likelihood to edges between 
vertices with the same color, precluding any such edges 
in the null model. 
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The modularity matrix B in turn has a block off- 
diagonal form of 



B 



O 



B 



pxq 



(BT) 



qxp ^qxq 



(6) 



where B = A — P. The all-zero blocks on the diago- 
nal are the potential modularity contributions from pairs 
of vertices of the same color being present in a mod- 
ule; all meaningful contributions, positive or negative, 
to the modularity thus are made by pairs of vertices 
with distinct colors. In contrast, with the usual null 
model based on unipartite networks [19], the correspond- 
ing blocks contain only negative elements (or zeros for 
isolated nodes of degree zero), always providing a modu- 
larity penalty for pairs of like-colored vertices in the same 
module. 

Equation (1) can be rewritten as 



^ p 1 



(7) 



i=l 3=1 



where hj = gj+p- Since Q = when all vertices are in 
the same module, we can set all 5, and hj equal, giving 



p g 



so that 



E E 

1 j=l i=l j=l 



m 



(8) 



(9) 



Thus, the expected number of edges in the null model 
must equal the number of edges in the actual network. 

The degrees of the red vertices are given by 
Y^'j=i ^ij ~ while those of the blue vertices are given 

by J2^=i ^ij — ■ constraining the expected degrees 
in the null model to match the actual degrees, as dis- 
cussed above, we obtain 



^^Pjj — dj 



Since 



m 



i=l 
Q 



Y^dj = m , 
Eqs. (10) and (11) ensure that Eq. (9) holds. 



(10) 
(11) 

(12) 
(13) 



In the usual null model, the probability of an edge 
being present between two vertices is proportional to the 
product of the degrees of the vertices. For the bipartite 
case, this becomes Pij = Ckidj for some constant C. 
Combining this definition with Eqs. (11) and (12), we 
obtain 



di 



i=l 



1=1 
= (Cm) dj 

so that C = 1/m and thus 



p = 



m 



(14) 



(15) 



The same result can be obtained from Eqs. (10) and (13) 
instead of Eqs. (11) and (12). With Eq. (15), we have 
fully defined the modularity Q for a bipartite network. 



III. MODULE IDENTIFICATION 
A. Spectral Methods for Module Identification 

Using the modularity defined in section II, we can as- 
sess the quality of any partitioning of the vertices of a bi- 
partite graph into modules. A partitioning can be deter- 
mined using any method. Two general approaches seem 
relevant. First, the modularity defined in section II can 
be maximized using standard optimization algorithms 
such as genetic algorithms, greedy search methods [18], 
or extremal optimization [29]; this is generally straight- 
forward and will not be discussed at length in this work. 
Second, the spectral properties of B or other matrices 
associated with the graph can be analyzed to partition 
the vertices into modules. 

For example, one standard partitioning approach is to 
assign the vertices to modules using spectral partition- 
ing (SP). In spectral partitioning, the eigenvectors of the 
network Laplacian are used to minimize the number of 
edges running between groups. The SP approach has a 
significant drawback: the vertices are assigned to mod- 
ules of predetermined size. This is problematic for the 
investigation of real-world networks, where the number 
and sizes of community groups are not generally known 
in advance. 

An analogous approach based on the spectral proper- 
ties of the modularity matrix B has recently been pro- 
posed [19]. Since the modularity is conceptually closer 

to our understanding of network community structure, 
this spectral optimization of modularity (SOM) is better 
tailored for real world networks. 

An important special case in both spectral partition- 
ing and spectral optimization of modularity is to assign 
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the vertices to two groups based on a single eigenvector 
of the Laplacian (SP) or modularity (SOM) matrix. In 
the case of SP, we arc interested in the eigenvector corre- 
sponding to the smallest positive eigenvalue; this is the 
Fiedler vector. For SOM, we are interested in the lead- 
ing eigenvector x, corresponding to the largest positive 
eigenvalue A of B; we propose calling this the Newman 
vector. Using the Newman vector, wc approximate B as 

B « Axx^ . (16) 

[si I S2], so that the modu- 



With just two modules, S 
larity in Eq. (3) becomes 



(17) 



Recall that the index vectors si and S2 take on values 

from {0, 1}. It is clear how to maximize the modularity 
in Eq. (17); when Xi, the ith element of x, is positive, 
assign vertex i to the first module by setting the ith entry 
of si to one, and when Xi is negative, assign vertex i to 
the second module by setting the ith entry of S2 to one 
[37]. 

The use of multiple of eigenvectors allows more than 
two modules to be considered (c > 2), with at most one 
module more than the number of positive eigenvalues of 
B [19]. Additional eigenvectors of B can also be used 
for SOM [19] in a vector partitioning algorithm adapted 
from spectral partitioning [30, 31]. In the present work, 
we will not make use of this algorithm, nor of a recursive 
bipartitioning approach, instead developing an alterna- 
tive technique that capitalizes on the bipartite nature of 
the networks. 



B. Module Identification in Bipartite Networks 

In section III A, we have seen how to identify commu- 
nity groups of networks by using the Newman vector to 
maximize Q. However, we made no use of the bipartite 
character of the networks. For a bipartite network, the 
eigenvalue equation Bxj = AjXj can be written as 






B 
























(18) 



where Uj is a p x 1 vector and is a g x 1 vector. The 
left-hand side of Eq. (18) can be multiplied out, giving 



(19) 



i.e., Bvj = AjUj and B'^u, = A,v,. 

Additionally, we can construct a vector from Uj and 
— Vj, so that 



• (20) 
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Hence, for any eigenvalue Aj of B, — Aj is also an eigen- 
value of B. 

Since only the eigenvectors corresponding to positive 
eigenvalues of B can give positive contributions to Q, we 
can focus on just the positive eigenvalues Ui = |Ai| > 0. 
In this case, and v^ are, respectively, left and right 
singular vectors of B. If we shift our attention from the 
spectral decomposition of B to the singular value decom- 
position (SVD) of B, wc therefore automatically exclude 
the eigenvectors of B that correspond to negative eigen- 
values. 

The appearance of the singular vectors of B is not sur- 
prising. All the information about the linkage structure 
of the network is contained in B, and the singular value 
decomposition is the natural generalization of the spec- 
tral decomposition used for B to asymmetric matrices 
like B. What is more, the singular values and singu- 
lar vectors of B can sometimes provide more information 
than the eigenvalues and eigenvectors of B. 

For example, the number of modules is at most one 
more than the number of positive eigenvalues of B. Since, 
for each vertex, the expected degree in the null model 
equals the actual degree in the network, the rows and 
columns of B all sum to zero. The rank r of B, which 
equals the number of singular values of B, must then be 
less than both p and q. From this, we conclude that the 
number of communities is at most equal to the smaller 
of p and q. 

To assign vertices to modules using B, we first parti- 
tion the index matrix S so that 



R 
T 



(21) 



The matrices R and T have dimensions p x c and q x 

c, respectively, indexing the red and blue vertices into 
c modules. Substituting the partitioned matrices into 
Eq. (3), we obtain 



Q 



1 



Tr R^BT 



(22) 



Our goal then becomes to assign network vertices to mod- 
ules such that Eq. (22) is maximized. 

One approach to optimizing the modularity as ex- 
pressed in Eq. (22) is essentially the same as the New- 
man vector approach considered in section III A. With- 
out loss of generality, label the singular values such that 
ci > (72 > CTr > 0. Approximate B as 



B K, (TlUlVi 



(23) 



Now, we bipartition the vertices with R = [ri j r2] and 
T = [ti I t2], so that 

g=^((ri,ui)(ti,vi) + (r2,ui)(t2,vi)) . (24) 

As with the Newman vector approach, Q is maximized 

by assigning the vertices to modules based on the signs of 
the corresponding component of Ui or vi , as appropriate. 
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This maximizes the magnitude of the inner products in 
Eq. (24), with consistent assignment of both red and blue 
vertices to the same module based on the signs ensuring 
that positive contributions are made to the modularity. 



C. Recursive Identification of Bipartite Modules 

In sections III A and IIIB, we have seen how the lead- 
ing eigenvector of B and the leading singular vectors of 
B can be used to bipartition network vertices. Extending 
these methods to use the full modularity matrices and to 
handle more than two modules is in general nontrivial. 
However, for the bipartite case at least, there is a rel- 
atively straightforward extension that leads to a useful 
algorithm. 

First, we assume that the blue vertices are all assigned 
to modules through some mechanism. Maximizing the 
modularity then consists solely of assigning the red ver- 
tices to modules. This is a comparatively simple task. 
To see this, rewrite Eq. (22), giving 



Q = — Tr R^BT 

m 



-Tr R^T 



m 



(25) 



where we have aggregated the fixed terms into the matrix 
T = BT. We now write Eq. (25) in terms of explicit 
sums, so that 



c P 



Q — — RikTik 

k=\ i=l 



(26) 



The inner sum in Eq. (26) is a sum across the rows of 
R. Since each row of R consists of a single 1 with all 
other elements being 0, the modularity is now simple to 
maximize: we just assign red vertex % to module k such 
that Tife is the maximum of the ith row of T [38] . 

Conversely, if the red vertices are all assigned to mod- 
ules, maximizing Q consists of assigning the blue vertices 
to modules. Analogously to the previous case, we define 
R = B"'"R and manipulate Eq. (22) into the form 

^=\X.\il^^^^A ■ (27) 
j=i \k=i / 

As with the red vertices, we maximize Q by assigning 
the jth blue vertex to the module k such that Rjk is the 
maximum of the jth row of R. 

Taken together, these two maximization procedures 
define an algorithm that we call BRIM (bipartite, re- 
cursively induced modules). The BRIM algorithm is an 
iterative algorithm for maximizing Q, with the sets of 
red and blue vertices each recursively drawing the other 



into modular structures. For each iteration, Q is guaran- 
teed never to decrease, as it is always possible at least to 
maintain the previous vertex partitioning and keep the 
modularity the same. Therefore, the BRIM algorithm 
will always find a partition at a maximum of Q. In gen- 
eral, the identified partition will correspond to a local 
maximum in Q, not the global maximum. 

Note that the BRIM algorithm can work with the en- 
tire B matrix, or a rank-restricted approximation calcu- 
lated by omitting the smallest singular values. By using 
the full B matrix, we automatically include all positive 
contributions to the modularity. As well, the algorithm 
can work with any assumed number of modules; how- 
ever, no constraint exists to ensure that each module is 
occupied. 

To test the efficacy of the BRIM algorithm, we apply 
it to a simple model network. The model consists of 
Njnod modules, each containing A^red red and A^tiue blue 
vertices. An edge exists between a red vertex and a blue 
vertex with probability pi^ if they are in the same module 
and with probability Pout if they are in different modules. 
No edges exist between vertices with the same color. 

The qualitative behavior of the model depends on 
and Pout- When pin > Pout, there is a greater proba- 
bility of vertices within a module being linked than ver- 
tices in different modules, matching our intuitive notion 
of modularity. With pin sufficiently close to one and Pout 
small, the actual modular structure of a particular real- 
ization of the model should correspond to the assumed 
modular structure. As Pout Pin, the network becomes 
more uniform, with the assumed modular structure ul- 
timately vanishing and all vertices belonging to a single 
module. [39] Lower values of pin introduce additional sub- 
structure into the modules; the general behavior as pout 
varies should be similar to the previous case, but with 
an overall reduced correspondence between the assumed 
modules and the actual modules in networks instantiated 
from the model. 

Following Danon et al. [14], we make precise the above 
qualitative description in terms of the normalized mu- 
tual information /norm- Consider two schemes X and Y 
for dividing the n vertices into community groups, repre- 
sented by two index matrices Sx and Sy [40]. The two 
index matrices are used to calculate the so-called confu- 
sion matrix N, which takes the simple form 



N = SJcSy 



(28) 



The probability P {X = x,Y = y) that a vertex is as- 
signed to community x in scheme X and to community y 
in scheme Y is proportional to the corresponding element 
Nxy of the confusion matrix, so that 



P {X = x,Y = y) = -N^y 



(29) 



Using the probability as defined in Eq. (29), we can cal- 
culate the normalized mutual information as 



XX,Y) = 



'2nX,Y) 
H{X)+H (Y) 



(30) 
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Equation (30) is expressed in terms of the usual mutual 
information / {X, Y) and entropies H (X) and H (X) 
[32], defined as 

IiX,Y) = Ei'(^,i^)log;p^|^ (31) 
HiX) = -J2P{X)logPiX) (32) 

X 

H{Y) = -EF(y)logP(r) . (33) 

V 

In Eqs. (30) through (33), we have made use of the 

common shorthand abbreviations P {X = x^Y = y) = 
P (X, r), P (X = x) = P (X), and P{Y = y) = P {Y). 
The base of the logarithms in Eqs. (31) through (33) is 
arbitrary, as the computed measures only appear in the 
ratio in Eq. (30). 

The normalized mutual information is a measure of the 
amount of information common to the two partitioning 
schemes. By taking one of the partitions to be the as- 
sumed modular structure of the network and one to be 
the structure found using the BRIM algorithm, we can 
thus explore the efficacy of the algorithm. When the 
found modules match the real ones, we have /norm = 1, 
and when the found modules are independent of the real 
ones, we have /norm = 0. 

We now set TVmod = 5, iVred = 12, and A^biue = 8, giv- 
ing n = 100 vertices in the network. With various choices 
of Pill and Pout, wc repeatedly instantiate the model net- 
work and determine the assignment of vertices to mod- 
ules using the BRIM algorithm. The algorithm is initial- 
ized by assigning each of the blue vertices to a unique 
module. For each sample, we calculate /norm- 

In Fig. 1, we show results of applying the BRIM algo- 
rithm to the model network. The points show the mean 
value of /norm, avcragcd over 100 instantiations of the 
network. The error bars show the standard error of the 
mean. The general behavior is as anticipated, lending 
confidence to the algorithm definition. 

D. Determining the Number of Modules 

The BRIM algorithm is silent on the issue of how many 
modules should be used. As noted in section IIIB, the 
number of modules c is at most one more than the rank 
of B, which is a relatively weak constraint. One approach 
is thus to assign each vertex of the smaller of the red and 
blue vertex sets to unique modules, and allow the ver- 
tices to be grouped into an appropriate number of mod- 
ules. For the BRIM algorithm, said approach is resource 
intensive, requiring the calculation of modularity contri- 
butions for what may be a grossly overestimated number 
of modules. Worse still, when the number of vertices is 
much greater than the number of modules, the BRIM al- 
gorithm may terminate at low-quality local maxima far 
from the true number of modules in the network (see 
section IV B for an example of this) . 
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FIG. 1: Agreement between model network modules and 
modules found using the BRIM algorithm. Each point shows 
the mean normalized mutual information between the model 
network community groups and those identified using the al- 
gorithm, averaged over 100 realizations of the model network. 
Error bars show the standard error of the mean. 

Clearly, automatically selecting the correct number of 
allowed modules in such a case would be preferable. The 
allowed number of modules c thus becomes an adapt- 
able parameter for which a value is to be found that op- 
timizes the modularity. This presents some difficulties 
in that there is no obvious relationship between the al- 
lowed number of modules and the modularity found by 
the BRIM algorithm. However, by assuming that the 
modularity depends on the allowed number of modules 
in a reasonably smooth fashion, we can use a simple bi- 
section approach to identify an appropriate value for the 
number of allowed modules. 

The search begins by requiring all vertices to belong 
to the same module, c = 1, giving Q = 0. We double 
the allowed number of modules c. Half of the vertices are 
randomly reassigned to the newly defined modules, and 
a new, locally optimal solution is found using the BRIM 
algorithm. This process continues, with c being repeat- 
edly doubled so long as Q continues to increase. Each 
step in the c-search builds on the previous solution by 
partially reusing the assignment of vertices to modules. 

Once Q drops as c increases, we have crossed a maxi- 
mum in the modularity landscape. We therefore switch 
from extrapolating to larger numbers of modules to inter- 
polating within the interval that includes the maximum. 
The interpolation is done using a simple bisection search 
in the allowed number of modules, trying new values for 
c so as to continuously reduce the interval wherein the 
putative maximum in Q lies. As with the initial extrap- 
olation stage of the search, vertices are assigned from 
earlier solutions to the newly allowed modules for each 
value of c, and a new, locally optimal solution found. 

The search for c terminates once the interval becomes 
sufficiently small. In this work, we take the interval to 
be 2, i.e., the Q maximum at c = Cmax is bracketed by 
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inferior solutions at c = Cmax — 1 and c = Cmax + 1- 
This adaptive BRIM algorithm enables us to identify the 
appropriate number of modules Cmax in a number of steps 
that scales logarithmically with the number of vertices in 
the network. 



IV. RESULTS 

In this section, we apply the BRIM algorithm to a net- 
work showing the interactions of women in the American 
Deep South at various social events [20] and to a network 
showing corporate interlocks in Scottish firms [33]. Both 
networks are conveniently available on the World Wide 
Web in Pajek format [34]. 

A. Southern Women Event Participation 

As an initial example, we consider the Southern women 
data set, collected by Davis et al. [20] in and aroimd 
Natchez, Mississippi during the 1930s as part of an ex- 
tensive study of class and race in the Deep South. This 
data set and networks derived from it have been much 
studied. Indeed, Freeman [21] has described it as "... a 
touchstone for comparing analytic methods in social net- 
work analysis." 

The Southern women data set describes the participa- 
tion of 18 women in 14 social events. The women and 
social events constitute a bipartite network; an edge ex- 
ists between a woman and a social event if the woman was 
in attendance at the event. The network is connected. 

We identified network modular structure using the 
BRIM algorithm. The initial state is in general impor- 
tant. The dependence on the initial state is most visible 
in the quality of the stable solution, i.e., the algorithm 
can get "stuck" at a poor quality local maximum. We 
initialized the assignment of events to modules in T us- 
ing several strategies: (1) assigning all events to a single 
module, (2) assigning each event to its own module, and 
(3) randomly assigning events to modules. 

For this network, all three strategies identify modular 
structures. The first strategy produces a good quality 
solution (4 modules, Q = 0.34554). The second strategy 
also produces a solution that captures a great deal of the 
modular structure, but is somewhat coarser than the first 
(2 modules, Q = 0.32117). The third strategy, random 
initial assignment, sheds light on the quality of the first 
two. Because the network is small, a large number of 
trials can be run without difficulty; we ran 500,000 tri- 
als. The greatest modularity found equalled that found 
with all events initially in unique modules, Q = 0.34554, 
indicating that this best solution found is quite good. 

In Fig. 2, we show the best assignment of vertices to 
modules determined using the BRIM algorithm with all 
events initially in different modules. The shapes of the 
vertices show which ones belong to the same modules, 
with four modules in all. Open symbols with black labels 



portray vertices corresponding to the women, and filled 
symbols with white labels portray vertices corresponding 
to the events. The positions of the vertices arc based 
on the singular vectors corresponding to the two largest 
singular values of B, with the right singular vectors giv- 
ing the coordinates for the events and the left singular 
vectors giving the coordinates for the women. Several 
vertices have been shifted slightly to prevent overlapping 
vertex symbols while preserving the overall character of 
the network. 




FIG. 2: Modules in the Southern women network. The women 
are represented as open symbols with black labels and the 
events as filled symbols with white labels. The modules arc 
indicated by the shape of the symbols. Vertices are positioned 
with coordinates based on the elements of the singular vectors 
corresponding to the two largest singular values of B; some 
vertices are repositioned slightly to eliminate overlaps. The 
vertex partition pictured has the highest modularity we have 
found for the Southern women network, Q = 0.34554 

The community groups found using the BRIM algo- 
rithm are comparable to those found in previous inves- 
tigations of the Southern women data set (Ref. [21] pro- 
vides a useful survey) . Most such studies have focused on 
the women, leaving the groupings for the events unspeci- 
fied; we can use the groupings of the women to assign the 
events to the best modules, as described in section IIIC, 
and calculate modularity values for purposes of compar- 
ison. The community groups can be further compared 
using the normalized mutual information between the 
various groupings of the women and the best grouping 
found using the BRIM algorithm. Values of Q and /norm 
are summarized in table I and discussed in depth below. 

In the original investigation, Davis et al. [20] used gen- 
eral ethnographic knowledge of the community to as- 
sign the women to two groups. The groups consisted of 
women 1-9 and of women 9-18; woman 9 is a secondary 
member of both groups. To bo consistent with the def- 
initions in section II, we must assign this individual to 
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TABLE I: Comparison of modules in the Southern women 
network. Where necessary, the modularity values Q are calcu- 
lated from an optimistic assignment of the events to the best 

possible modules from a given assignment of the women to 
modules. Values of the normalized mutual information /norm 
are calculated between the given divisions of the women and 
the best division found using the BRIM algorithm. 

Modules Q /norm 



BRIM 
Spectral 
Davis 1 
Davis 2 
Doreian 
Unipartite 



0.34554 
0.32117 
0.31057 
0.31839 
0.29390 
0.21866 



1 

0.56897 
0.44657 
0.45126 
0.60766 
0.28019 



jection. Wc first use the Newman vector to partition the 
women into two groups as described in section III A, with 
women 2 and 4-7 in one group and all others in a sec- 
ond group. Next, we determine the best assignment of 
events to modules using the approach from section IIIC. 
Together, this gives the values shown in table I for the 
label "Unipartite," which reflect that some of the mod- 
ular structure of the network has been captured but is 
generally inferior to the solutions found from the bipar- 
tite network. Further, the solution from the unweighted 
projection does not correspond to a maximum in the bi- 
partite modularity; using the solution as the initial state 
for the BRIM algorithm, a solution is obtained with two 
modules identical to those found using spectral biparti- 
tioning as described in section IIIB. 



a specific group. The Q and /norm values are seen from 
table I to be similar for both assignments, with the case 
where woman 9 is grouped with women 10-18 labeled as 
"Davis 1" and the case where woman 9 is grouped with 
women 1-8 labeled as "Davis 2." The latter division is 
the same as what Freeman [21] identified as the consensus 
from 21 different studies of the Southern women data set. 
The Q and /norm values are reasonably similar to values 
found for two modules using either the BRIM algorithm 
or spectral bipartitioning as discussed in section IIIB, 
which groups the women into sets {1-7, 9} and {8, 10- 
18} (identified in table I with the label "Spectral"). 

Doreian ct al. [22] considered the modular nature of 
both parts of the network, suggesting several divisions 
of the women and events. The division with the great- 
est modularity (given in their Table 4) is characterized 
in table I with the label "Doreian." Taking just their 
partitioning of the events into three groups (events 1-5, 
6-9, and 10-14) and replacing their partitioning of the 
women using the approach from section IIIC, the mod- 
ularity can be increased from 0.29390 to 0.32950. This 
is similar to the best assignment of vertices to modules 
we described above, with modularity of 0.34554, wherein 
the additional structure produces a modest, but real, im- 
provement in the modularity. 

It is also of interest to compare the community groups 
obtained for the Southern women network using the bi- 
partite network to those found using an unweighted pro- 
jection network. Here, we focus on the projection con- 
sisting of the eighteen women as vertices, with edges de- 
fined by mutual participation in events. The best divi- 
sion we found for the women, discussed above and shown 
in Fig. 2, actually has a negative value for the standard 
unipartite modularity; it is thus better to use only a sin- 
gle module containing all eighteen women than the best 
module found for the bipartite network. Since the mod- 
ules we identified from the bipartite network using the 
BRIM algorithm are similar to those found in numerous 
other studies, this highlights the difficulties that can arise 
using a unipartite projection. 

Conversely, we can determine the bipartite modularity 
for community groups found using the unipartite pro- 



B. Scotland Corporate Interlock 

As a second example, we consider a data set on corpo- 
rate interlocks in Scotland in the early twentieth century 
[33]. The data set characterizes 108 Scottish firms dur- 
ing 1904-5, detailing the corporate sector, capital, and 
board of directors for each firm. The data set includes 
only those board members who held multiple director- 
ships, totaling 136 individuals. 

Here, we focus on the bipartite network of firms and 
directors, with edges existing between each firm and 
its board members. Unlike the Southern women net- 
work, the Scotland corporate interlock network is not 
connected. In the following, we consider only the largest 
component of the graph, containing 131 directors and 86 
firms — and thus, as many as 86 modules. 

As with the Southern women network, assigning all di- 
rectors to unique modules or to the same module results 
in a solution that captures some of the modular character 
of the network, with Q = 0.56634 and Q = 0.39873, re- 
spectively. However, in contrast to the Southern women 
network, these are rather poor solutions to what can be 
found starting from a random assignment of directors to 
modules (see Fig. 3). 

Further, the best solutions are found by restricting 
the allowed number of modules c to less than the max- 
imum. In principle, allowing the number of modules to 
take on any size leaves the BRIM algorithm to search 
the largest possible space, potentially finding the largest 
possible modularity value. In practice, the results are 
inferior to those obtained from a more restricted search. 
In Fig. 3, we show the results, in terms of the actual 
numbers of modules occupied and modularity values, for 
BRIM searches with the allowed number of modules re- 
stricted. This trades off the possibility of higher modular- 
ity values in the excluded region for improved searching 
in the remaining region. The trade-off is clearly a good 
one, as the best solutions are found with fewer than thirty 
modules. 

In Fig. 3, we also show three runs of the adaptive BRIM 
algorithm described in section HID. The lines show the 
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progress of the number of modules and modularity value 
during the search. The number of modules c allowed for 
the BRIM search is typically close (within 10%) to the 
number of modules actually found, suggesting that the 
adaptive approach eliminates a wasteful search through 
vertex assignments with too many modules. The three 
traces all show typical behavior and lead to good solu- 
tions; two of the adaptive runs lead to better solutions, in 
terms of modularity, than any of the much larger number 
of trials using BRIM with a fixed c. 
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FIG. 3: Quality of solutions found in the Scotland corporate 
interlock network. The modularity Q depends on the allowed 
number of modules c. The points correspond to solutions 
found using the BRIM algorithm starting from a random ini- 
tial assignment of vertices to modules. The values on the 
ordinate indicate the number of modules occupied by at least 
one vertex in the solution state found by the BRIM algo- 
rithm. All points are slightly dithered to better show regions 
with many similar or identical solutions. The lines show the 
course of an adaptive search for the correct number of mod- 
ules to maximize the modularity, terminating at states with 
the modularity and number of modules shown by the crosses. 

Based on the solutions shown in Fig. 3, the main com- 
ponent of the Scotland corporate interlock network has 
roughly twenty community groups, considerably fewer 
than the 131 directors or 86 firms. This analysis could 
serve as a starting point for an investigation of the com- 
munity structures of the firms or directors. A more com- 
prehensive analysis would take into account the available 
information on the corporate sectors and capital of the 
firms. 



V. CONCLUSIONS 

We have defined and explored a modularity appropri- 
ate for bipartite networks. The presented results ex- 



tend and specialize the matrix-based approach recently 
reported by Newman [19] for unipartite networks. The 
bipartite structure of the network is reflected mathemat- 
ically in the importance of an asymmetric submatrix B 
of the full bipartite modularity matrix B, with a corre- 
sponding emphasis on the singular value decomposition 
of B instead of the spectral decomposition of B. We 
made use of the properties of B to define an algorithm, 
BRIM, for use in identifying network modules. By ap- 
plying the algorithm to real-world networks, we demon- 
strated its effectiveness and identified some of its limita- 
tions. 

The usual unipartite modularity has a limited resolu- 
tion that depends on the number of edges in the net- 
work [35]. The main consequence of the resolution limit 
is that the modules in large networks may have hidden 
substructures that require deeper investigations to reveal. 
Although we have not shown it, we expect that the bi- 
partite modularity introduced in this work has a similar 
resolution limit, with similar consequences. 

One of the key themes in this paper has been that 
the bipartite structure of the network can be benefi- 
cially incorporated into its mathematical description and 
its computational treatment. This theme was realized 
in the BRIM algorithm, where the assignment of ver- 
tices to modules in one part of the network, when held 
fixed, provides a stable modularity landscape in which it 
is straightforward to partition the vertices of the other 
part into modules. We expect that the characteristics of 
other specialized classes of networks could be taken ad- 
vantage of in an analogous fashion to define appropriate 
null model networks, modularity measures, and commu- 
nity detection algorithms. 

The eigenvalues of the graph Laplacian are closely re- 
lated to many important properties and invariants of the 
graph [36]. In contrast, relatively little is known about 
the spectra of modularity matrices, be they for unipar- 
tite or bipartite networks. We are optimistic that the 
eigenvalues of the modularity matrix usefully relate to 
important and interesting network properties. 
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